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ABSTtiACl 4 ^ 

^ High school and college grades are suggested to 
the nature of ordinal scales rather than interval scales; , hence 
■edian is the appropriate and preferred average, rather than the 
•ean^' The use of a eedian grade point , average has scne of the 'aajor 
advantages gt pass^fail grading* issuning grades to be ordinal rather 
than interval data suggestls that estimates ojC teit bias should not be 
sub;|ected onlV to analysis oi covariance* is an alternative/ 
eguiperceatile eguatitfg procedures are suggested as a^ -lethod of 
studying bias that vould tend to be acre favorable fat lov-scoring 
groups* (Author) 
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ttatljoni Ca^es. for ^rdlnal Measurement 
Charles T, Myers ^ * ^ 

A/cording t^T^blre No, 76 of the Statistical Abstract of" thepynited 
States 1971 ttfere vere in this country during 1968 a total of 18,^51 deaths 
dtj^ tp'ag^dental falls.» Surely* there is some truth in the ruxrior that there 
ill afeon be a demand fpr Congress to act promptly t?o repeal the law of 



A; 



gBGvitationy^ No other law works so qpnstantly to limit movements and ta 

y / '/ ' ^ V 

diminish/freedom. Furthermore^ /this law Is obvioiisly biased against ob.ese' 
.peog^e — people vho already 'suffer fr'om our social prejudices^ . Rp jdtj>er — law 

been in effect for a lortger timetwithout -having been'revised^-to ieet^the 
needs of a changing society ^-j^^ If you detect in these^reoarks a nc^l ^^ ^ 
sarcasm, perhaps it will/ suggest' to you a certain .skepticism about demands 
for educational rafprm? '^^aedofa,' equality,^ and^rogr^ess are all .,gri^atly 
desired, but they ma^^ itt:j£a<r£^onf JLict with *each othe^ and not every demand 
or ei^ressed need w^uld in; fact, work to the^beneJit of society 1 . 

On -^the oJ^K^ryhandi' educarlpiial ^Sla^itutlons iiajr in fact suff6p from uadue 




dity -and tys 



^e fa^ltu 



Fure to examine the assumptions on which they were founded, 
p£*is the purpose of this paper to examine pgs&lble sources of bias in two of 
the vecurjentr topics of student dissatisfaction: the treatment of standard- 
ized test scores/ and tjie nature, of course ^rades^-in college and high school. 
It has bjSeiT coifimon with respect to both af these forms of measurement, to assume 

I J ■ ■ ■ ' 

that they are properly treated as interval scales. This paper will disctjiss 

I I * 
the alternative possibility that they should be treated as ordinal scales. 

7 7' ' ' ■ ' - ' 

fAxopnj^ the objections t^hat have been raised to conventional course grading 
systems/ perhaps ^he jibs t compelling ai^e thesw- that the nee^d to maintain a *^ 



( 



good gr,ade-point average inhibits students from taking .chances with difficult 
courses or courses in areas with' which they are unfamiliar, and that, simila,rly, 
the need to maintain a good grade-point average may corrode the relationship 
-between teacher and student. So much is thought to depend upon the academic 
record and on grade-point averages (which ai:e almost always the mean grade/ 
that there is strong mot^ivation for students tp play either a subservient 
role to that of the teacher or a competitive role with regard to op'heri 
students. * ' ; / - 



V 



However, this situation i^ altered substantially if g^ade/ are assun^ to 
be ordinal data rather than interval data, for in that c^se t?4e ^propriate 
average then becomes the median. In such a system a studeat need no logger 
fear a difficult or an unfami;.iar field, nor an unfair^e4ch*|' because the 
one lowest grade'out of four or five grades will Jiave'np effect 



.fe.ct yhatsoever 



on that average. Whfen grades are treated ^as ord:^al a^cales^ the students 

receive much of the benefiVtji^ "pass-fail" systems, / But ndte that pass-fall 

^ / 
grades may lower a student's average if he^fails. While they cantiot' rllse the 

average *lth a pass no matter how well the studenft hda peMormed. The 

"pas^jno- record" system is somewhat better than "pass-fail" In this regard, 

but it still lacks some of the vir^es'-o^ ordinal-scate grading. ^ ^ 

These comments on ordlnal-^cale grading are /Sty^tegic .rdtbfer ^han philo- ^ 

sophical. Qn that aspect , df the problem I vill /on^^ ask the cfuestlon, "how ' 



would one- establish the contention th^t 
scale ^ata, Vatt;y:ularly' in colle-ges/wh 



to have been ,$h&rply curtailed in dhe competitive admissions procjess'?^ The use 



of systems of grading baised on^ the assumpt 
cl^s cl€iprly doe3*not answer th^ problem. 




ifuld be treated as ^^terval- 
ity distribution is likely 



n#rmal curve^ withTiS each 
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Current theories of educati-onal and psychol6gical measurement have be^n 

'^it^^ In large measure from the Statistics of ganibliag* and anthropometric 

//oe^urement. In both these fields It is common for 'distributions of events- 

' ^ox of individuals to appear to approach nomiajit^y - /Test; statistics also 

f r^uently result ''in distributions that are ^^fyiiaietrical abd not dramatically ^ , 

nonnormal, although It is probabJLy true^d^t a niajorityof score distributions 

tend to be viois^ platykurtic thaa norjaal Perhaps because of this hlstor^ and ' 

perhaps because the theory of llo^^r regr^ess^on has been more thoroughly 

developed than other al^mati/^s,, most discussions and investigations of 

»^ests assume normality for ^l^oth univariate and multivariate^ distributions and 

: ' * 

linear relationships betscfeen all vai:iables. Significance test> for linear ityv 

4p not app^^ in the n^j^rity of reports of educational psychological 

stodi^r F ( 

y Pgrhaps thea6 observations may be useful with^respect to a related problem 

. < / 

now being fifeed^ in the academic world — the problem' of potential bl^s iA the 

/ ' ' ' * 

-prediction oygtades by thd use of- achievement ajid aptitude tests. I51 this 

c^se it may/be particularly impai;tant to consider carefully the basic assump- 

tions whim haVe been applied to th^ treatment of test score data.tC 



/ Validity studies such as. those 



of Cleary (1968) and analyses such as 
those at thomdike (1571) and Darlington (1971) use the .analysis of cpvSiriance 
model laii^ its assumptions regarding interval measurement* and linear relatibrf- 
0hipf ^ / Test "score distributions aiTe descriptions of the jrelationship between 



a t as )t -composed of a set of items' and a sample camposed>of a set of people. 
/It /can be demonstrated that a change in either set is likely to change the 
\ sfajape of the distribution* When tests are designed tjo maximize reliability 
Ijji jihe classical t^st-theory molel for a specified saanple of people, the mean 



■ //' 
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score will approach the middle of the possible-score range and the distribu- 
tipn will usually tend to be "symmetric. However,- whpn either" the -test items, 
ar peopltf are^rkedly changed so that tlie items are more difficult or the^ ' 
p'epple less s^ftb^sful, then the score (fistribution teada to be skewed, 
particulatly so when the isean sc£re Approaches th^ lower limit of possible 



scores. ' 
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In order to simplify the presentation of tfie analysis of covari^ncja-^ 
model it is common to represent score and criterion relationships by either 

■ ■ • ■ V 

a regression line or by ^ ellipse that represents contour lines of an 
. idealized biyariate. normal diicributiont Vhe center of the ellipse is located 
, at the coordinates of the means of the two^ualifarlatfi^distributions and tha 
regression line^asses througfc th^' center with a* slope less than one (when, 
as^l^ usual, scores scales are .drawn with equal standard deviations). Another 
liJie through the center with a slope of one my be drawn to represent the 
major diagonal of the ellipse. **This line represe;its,, among othfer things, the 
equipercentile relationship between the test and the criterion—the line of • 

, — * 

relitio^hip commonly used to d«scr±fr5Ttr4 ordinal relation'ship^ between two 
variables. (See Figure 1.) . * .* 

\ . . ,, _____ ■ ■ ^ ; 

Insert Figure 1 d^oxit h/re / • 



Now thfe^.ppint of this discussion'ds tjiat the equipercentile line of 
relationship caitrf^ver be identical with the r^ress ion line since tests are 
never perfectly reliable. Hence, if we are concerned with the relatfonship 
bietween two variable tck two distinct groups with 'different mean scores on 
eittier variable, then the interval-scale interpretation of the relationship 
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^ust dlf£e£ from the ordinal scale Interpretations Furthermore, in general, 
the ordinal interpretation will tend to be more favorable to the low scoring 
group." ' , \ 

The regression line and the equl$ercentlle line o£ relationship between 
a predictor test and a predicted criterion croas at the mean and median when 
the bivariate dlstributiorf is linear and symmetrical. For higher scores than 
the mean,, the regression line "will be below the equipercentile line and the ' 
predicted criterion score will be l«fwer thstn the equivalent equipercentile 
score. For scores below the mean, the relationship is reversed. If tjie 
bivariate^ distribution is irregular, skewed, qr curvilinear, Che two lined 
may not cross exaiPtly at the mean, but the same general t^lationship will 
'^still exist. ' If one is concenied with two distinguishable groups that are 
being* compared by means of the same predictor* and criterion, then either one 
of two things must occur. Either the regression linetfor the low scorlrfg 
group will be below the regression line for the high scoring group or the 
equipercentile lin% for the low scoring group yin be above the pther 
equipercentile line. Hence it is of considerable advantage to high scoring 
members of the low scoring group to have the^ equipercentile line used.^ 

It should be noted that the use of the eqtiipercentile line is, in the 

linear j case, the same strategy as Thomdike has described as Case C and ^ * / 

. I - ' . -.^ 

Darlington has despribed as Definition 2. However, it seems worthwhile to 

^ X- 

point out that these soiwtions may not only result in a^^airer" ititerpreta- 
tlon of scores in some sense, they may also avoid tlje^use' of assumptions that 
are perhaps sometimes difficult to justify or which may In fact not be 

5f*r ' ' 

justifiable at all. ^ ' - ^ ' 



Hominal scales, are properly summarized by the mode and'miiber 'of cate- 
gories. Ordinal scales are properly siiLarized by the median and the semi- 
interquartile range: the relationship between two ordinal scales is described 
properly by equipercentile procedures and by rank' cbrrel^t^p^dthods. Interval 
scales- are properly summarized by the mean and standard deviation or by the • 
mean and Variance; and the relationship between two or «re such scales is 
properly described by product-moment correlations and in some circumstances 
by art^lysis of 'covariaice. Studies of bias should validate, insofar as pbssible 
the assumptions that have been made in the model.. When there is reason to 
dqufct the l^^ear model, it would be best to examine the relationships that 
would obtain for ordinal interpretations of the data." In general, the oi^dinal 
interpretation will- be moje favorable to low scoring groups than is an inter- 
pretation based on the assumption of an interval scale. r ' • 
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